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TITLE: PRODUCTION OF HETEROLOGOUS POLYPEPTIDES IN YEAST 

5 CROSS REFERENCE TO RELATED APPLICATIONS 

This application claims priority under 35 U.S.C. 119 of Danish application no. PA 
2000 01800 filed on November 30. 2000. and U.S. provisional application no. 60/256.602 
filed on December 19. 2000, the contents of which are fully Incorporated herein by reference. 

10 FIELD OF THE INVENTION 

The present invention relates to a method for production of heterologous polypeptides 
or intermediates thereof in yeast, in particular polypeptides of non-bacterial origin. The inven- 
tion also relates to DMA sequences and vector constructs for use in such method and trans- 
15 formed yeast cells. 

BACKGROUND OF THE INVENTION 

Yeast Is a widely used organism for production of desired proteins or polypeptides. Ex- 
pression of foreign proteins or polypeptides in yeast is disclosed in e.g. EP 163,529. US 
20 4,870.008 and US 5,618,676. 

For commercial application high yields of the expressed polypeptides is important. A 
number of different factors will have an impact on the expression yield including the promoter 
and so-called upstream activating sequences. It is generally accepted that the best result is 
achieved when a promoter that is homologous to the host organism is used for expression of 
25 both native and foreign genes in transformed host cells. Thus for expression of foreign genes 
in yeast a promoter associated with a yeast gene would be preferred. Several promoters 
useful for expression of heterologous proteins in yeast are known. The most commonly used 
yeast promoters include the PGK, TPl, ADH2 , PH05, MFal and GAP promoters. 

There is, however, a constant need for further promoters that have improved efficiency 
30 and the capability of being regulated for use in large scale thus enabling a high level of ex- 
pression yield in commercial production. 

SUMMARY OF THE INVENTION 

The present invention provides a process for expression of heterologous non- 
35 bacterial polypeptides in yeast wherein a yeast promoter associated with the yeast CIT1 
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gene or a functional part or variant of the CIT1 promoter is used for expression of the het- 
erologous gene. 

In its broadest aspect the present invention is related to a process for making a het- 
erologous, non-bacteriat polypeptide in yeast or an intermediate therefore comprising (i) cul- 
turing a yeast strain comprising a polynucleotide sequence encoding the desired polypeptide 
or an intermediate therefore under suitable culture conditions for expression of said polypep- 
tide, wherein the polynucleotide sequence encoding the desired polypeptide or its intermedi- 
ate is under transcriptional control of a CIT1 yeast promoter or a functional part or variant 
thereof, and (ii) isolation of the desired polypeptide or its intermediate. 

In another aspect the present invention it is related to a polynucleotide construct 
comprising a polynucleotide sequence encoding a non-bacterial polypeptide or an intemnedi- 
ate therefore and the CIT1 yeast promoter or a functional part or variant of the CIT1 pro- 
moter. 

In a further aspect, the present invention is related to a vector construct comprising in 
proper reading frame (a) the yeast CIT1 promoter or a functional part or variant thereof, (b) a 
polynucleotide sequence encoding a non-bacterial polypeptide or an intermediate therefore, 
(c) a suitable leader sequences and (d) possible a transcription terminator sequence. 

In a further aspect, the present invention is related to yeast cells being transformed 
with the polynucleotide construct or vector of the invention. 

The yeast promoter CIT1 directs the expression of the yeast citrate synthase gene 
C/r, vide Rosenkrantz et al. Curr Genet (1994) 25:185-195 and Rosenkrantz et al Molecular 
Microbiology (1994) 13(1). 119-131. The C1T1 promoter sequence is comprised in the se- 
quence complementary to the sequence from position 2434 to position 3155 of GenBank ac- 
cession number Z71616. 

In another embodiment of the present invention the CIT1 promoter sequence is con- 
tained in SEQ ID NO:1 and will typically consist of all or part of the nucleotide sequence of 
SEQ ID NO:1. The promoter sequence used in the present invention may furthermore con- 
sist of all or part of the nucleotide sequence from 10 to 722 of SEQ ID NO:1 . In a further em- 
bodiment the promoter will consist of all or part of the nucleotide sequence from position 150 
to 722 of SEQ ID NO:1 and in a still further embodiment the promoter may consist of all or 
part of the nucleotide sequence from position 150 to 530 of SEQ ID NO:1. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows an example of a yeast plasmid called pMT742 (Egel-Mitani et al., Gene, 
73, 1988, 99. 113-120). The plasmid contains an expression cassette comprising an EcoRI - 
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Xba\ fragment inserted into the plasmid between the transcription-promoter (located on a 
Sail - EcoR\ fragment) and the transcription-terminator of the S. cerevisiae TPI gene. 

Fig. 2 shows the 722 nucleotide DNA sequence containing the CIT1 promoter de- 
scribed in example 1 . Above the sequence is shown the nucleotide changes introduced by 
5 oligonucleotides El 02 and El 03. The underiined sequences denote the restriction sites in- 
troduced by these changes. The nucleotide sequence corresponds to the complementary 
sequence from position 2434 to position 3155 of GenBank accession number Z71616. 

DETAILED DESCRIPTION 

10 With "CIT1" yeast promoter is meant the natural or wild type yeast promoter sequence 

disclosed by Rosenkrantz et at. Curr Genet (1994) 25:185-195) and Rosenkrantz et al, 
Molecular Microbiology (1994) 13(1): 119-131) or its complementary sequence. It is well 
known in the art that not all part of a natural promoter. sequence is necessary for efficient 
transcription. Resections may be made at the N-terminal end or at the C-terminal end or at 

15 both the N-terminal and C-terminal ends. Furthermore one or more nucleotides can be 
deleted within the promoter sequence. Also, a minor number of individual nucleotides may be 
changes to other nucleotides, e.g. to introduce a suitable restriction site. Thus "a functional 
part or variant of the CIT1 promoter" in this context is intended to cover such modified 
forms of the natural CIT1 promoter sequence as long as the transcriptional promoter activity 

20 is not adversely affected. A retention of at least 75% of the transcriptional promoter activity 
compared to a similar construct where everything else is the same except for the promoter 
sequence will be preferred. The functional part or variant of the CIT1 promoter should not 
have less than 65% and more typically not less than at least 70 to 95% of the transcriptional 
promoter activity of the unchanged promoter. 

25 In the context of the present invention a variant of the CIT1 promoter will include any 

sequence obtained from the SEQ ID NO:1 by mutation, deletion, substitution, addition and/or 
modifications of one or more nucleotides as long as the promoter variant retains at least 65% 
of its transcriptional promoter activity in the unmodified state. 

Whether a functional part or variant of the CIT1 promoter has retained at least 65% of 

30 its transcriptional promoter activity in the unchanged form can be demonstrated in various 
ways for example by use of a reporter gene which's expression is detectable in a convenient 
way. The promoter and its variants may be preceded by upstream activating sequences 
"UAS" as well known in the art. 

A coding sequence is under "transcriptional control" of a promoter when RNA 

35 polymerase binds to the transcription initiation sequence and transcribes the coding 
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sequence into mRNA terminating at the transcription termination sequence and the mRNA is 
then translated into the polypeptide encoded by the coding sequence. 

By "in proper reading frame" is meant that the individual polynucleotide sequences 
are operably connected in such a way that the coding sequence is under the transcriptional 
control of the yeast recognized promoter and terminator sequences. 

By "yeast-recognized" promoter and terminator sequences is meant regulatory 
sequences that are functional in yeast. 

By "percent sequence identity" is meant percent identity between two nucleotide se- 
quences when aligned and compared nucleotide to nucleotide. Methods for sequence align- 
ment and for determination of the percent identity are well known in the art, see for example 
Smith T.F. and Waterman M.S. (1981) Identification of common molecular subsequences. J. 
Mot. Biol. 147:195-7. 

"POT is the Schizosaccharomyces pombe triose phosphate isomerase gene, and "TP/t" 
is the S. cerevisiae triose phosphate isomerase gene. 

By "insulin precursor" is meant a single-chain polypeptide which by one or more sub- 
sequent chemical and/or enzymatic processes can be converted to a two-chain human insu- 
lin. The single-chain insulin precursor will contain correctly positioned disulphide bridges 
(three) as in human insulin. 

By a "leader" is meant an amino acid sequence consisting of a pre-peptide (the 
signal peptide) and a pro-peptide. 

The term "pro-peptide" means a polypeptide sequence whose function is to allow the 
expressed polypeptide to be directed from the endoplasmic reticulum to the Golgi apparatus 
and further to a secretory vesicle for secretion into the culture medium (i.e. exportation of the 
polypeptide across the cell wall or at least through the cellular membrane into the periplasm ic 
space of the yeast cell). The pro-peptide may be the yeast a-factor pro-peptide, vide US 
4,546,082 and 4,870,008. Altematlvely, the pro-peptide may be a synthetic pro-peptide, 
which is to say a pro-peptide not found in nature. Suitable synthetic pro-peptides are those 
disclosed in US 5,395,922; 5,795,746; 5,162,498 and WO 98/32867. The pro-peptide will 
preferably contain an endopeptidase processing site at the C-terminal end, such as a Lys- 
Arg sequence or any functional analogue thereof. 

The term "signal peptide" is understood to mean a pre-peptide which is present as an 
N-terminal sequence on the precursor form of a protein. The function of the signal peptide is 
to allow the heterologous protein to facilitate translocation into the endoplasmic reticulum. 
The signal peptide is normally cleaved off in the course of this process. The signal peptide 
may be heterologous or homologous to the yeast organism producing the protein. A number 



of signal peptides which may be used with the DNA construct of the invention including yeast 
aspartic protease 3 (YAP3) signal peptide or any functional analogue (Egel-Mltani et al. 
(1990) YEAST 6:127-137) and the a-factor product of the MFa1 gene (Thorner (1981) in The 
Molecular Biology of the Yeast Saccharomvces cerevisiae . Strathem et al., eds., pp 143-180, 
Cold Spring Harbor Laboratory, NY). 

As used herein "a heterologous protein or polypeptide" is one in which the mature 
part of the protein sequence (i.e., other than the signal peptide, leader peptide, spacer pep- 
tide, and the like) is derived from a different species than that of the host cell. Preferably, the 
heterologous protein or polypeptide is a non-bacterial protein or polypeptide. With "a non- 
bacterial protein or polypeptide" is meant a protein or polypeptide not derived from a bac- 
terial cell. 

In one series of embodiments, the heterologous polypeptide is an insulin precursor or 
an insulin precursor analogue. An insulin precursor or insulin precursor analogue is a single- 
chain polypeptide that can be converted into insulin or an insulin analogue by one or more 
subsequent chemical and/or enzymatic processes. The insulin includes, without limitation, 
human insulin. An insulin analogue has one or more mutations, substitutions, deletions and 
or additions of the A and/or B amino acid chains relative to the native insulin sequence. Pref- 
erably, the insulin analogues contain deletions or substitutions of one, two. or three amino 
acids. 

"DesB30" or "B(1-29)" refers to a human insulin B chain sequence lacking the 830 
amino acid residue. "A(1-21)" refers to a native human insulin A chain. "MI3" refers to a sin- 
gle chain insulin precursor in which 8(1-29) is linked to A(1-21) via a short peptide bridge 
(AlaAlaLys). 

In another embodiment the heterologous polypeptide is GLP-1. The amino acid se- 
quence of GLP-1 is given /.a. by Schmidt et al. (Diabetologia 28 704-707 (1985). GLP-1 (7- 
37) and analogues thereof have attracted much attention in recent years. Other examples of 
a heterologous polypeptides are GLP'-2 and glucagon both belonging to the GRF (growth hor- 
mone releasing factor) family of peptides having a His or Tyr in the N-terminal position and Ser, 
Ala or Gly in the next position, vide Adelhorts K. et al.. The Journal of Biological Chemistry 
(1994) p 6275 -6278). 

By "an intermediate" is meant a polypeptide or protein which can be converted into 
the desired final product by suitable chemical or physical means. 

By "connecting peptide" or "C-peptide" is meant the connection moiety of the 8-C-A 
polypeptide sequence of a single chain preproinsulin-like molecule. Specifically, in the natu- 
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ral insulin chain, the C-peptide connects position 30 of the B chain and position 1 of the A 
chain. 

The polynucleotide sequence of the invention may be prepared synthetically by estab- 
lished standard methods, e.g. the phosphoamidite method described by Beaucage et al. 
(1981) Tetrahedron Letters 22:1859-1869, or the method described by Matthes et al. (1984) 
EMBO Journal 3:801-805. According to the phosphoamidite method, oligonucleotides are 
synthesized, for example, in an automatic DNA synthesizer, purified, duplexed and ligated to 
form the synthetic DNA construct A currently preferred way of preparing the DNA construct 
is by polymerase chain reaction (PGR). 

The polynucleotide sequence of the invention may also be of mixed genomic, cDNA, 
and synthetic origin. For example, a genomic or cDNA sequence encoding a leader peptide 
may be joined to a genomic or cDNA sequence encoding the A and B chains, after which the 
DNA sequence may be modified at a site by inserting synthetic oligonucleotides encodjpg the 
desired amino acid sequence for homologous recombination in accordance with well-known 
procedures or preferably generating the desired sequence by PGR using suitable oligonu- 
cleotides. 

The invention encompasses a vector which is capable of replicating in yeast and which 
carries a polynucleotide sequence encoding the desired polypeptide. The recombinant vector 
may be an autonomously replicating vector, i.e., a vector which exists as an extra- 
chromosomal entity, the replication of which is independent of chromosomal replication, e.g., 
a plasmid, an extra-chromosomal element, a mini-chromosome, or an artificial chromosome. 
The vector may contain any means for assuring self-replication. Examples of sequences 
which enable the vector to replicate in yeast are the yeast plasmid 2 ^m replication genes 
REP 1-3 and origin of replication. 

Alternatively, the vector may be one which, when introduced into the host cell, is inte- 
grated into the genome and replicated together with the chromosome(s) into which it has 
been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids 
which together contain the total DNA to be introduced into the genome of the host cell, or a 
transposon may be used. The vector may be linear or closed circular plasmids and will pref- 
erably contain an element(s) that permits stable integration of the vector into the host cell's 
genome or autonomous replication of the vector in the cell independent of the genome. 

The vectors of the present invention preferably contain one or more selectable mariners 
which permit easy selection of transformed cells. A selectable marker is a gene the product 
of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to 
auxotrophs, and the like. Examples of bacterial selectable markers are the dal genes from 



BacHius subtilis or Bacillus licheniformis, or markers which confer antibiotic resistance such 
as ampicillin, kanamycin, chloramphenicol or tetracycline resistance. Suitable markers for 
yeast host cells are ADE2. HIS3. LEU2. LYS2. MET3, TRP1, and URA3. A preferred select- 
able marker for yeast is the Schizosaccharomyces pompe TPI gene (Russell (1985) Gene 
40:125-130). 

In the vector, the polynucleotide sequence is operably connected to the promoter se- 
quence of the invention. The promoter sequence may be preceded by up stream enhancer 
sequences . 

The invention encompasses a recombinant expression vector which is capable of rep- 
licating in yeast and which carries a polynucleotide construct encoding the desired product or 
an intermediate therefore. The recombinant expression vector may be any vector which is 
capable of replicating in yeast organisms. In the vector, the polynucleotide sequence should 
be operably^connected, to the selected promoter sequence. The polynucleotide construct of 
the invention may also be operably connected to a suitable terminator, for example the TPI 
terminator (Alber et al. (1982) J. MoL AppL Genet. 1:419-434). 

The procedures used to ligate the polynucleotide constructs of the invention, the pro- 
moter and the terminator, respectively, and to insert them into suitable yeast vectors contain- 
ing the information necessary for yeast replication, are well known to persons skilled in the 
art. It will be understood that the vector may be constructed either by first preparing a DNA 
construct containing the entire DNA sequence of the invention, and subsequently inserting 
this fragment into a suitable expression vector, or by sequentially inserting DNA fragments 
containing genetic information for the individual elements (such as the signal, leader, pro- 
moter and the desired product) followed by ligation. 

The yeast organism used in the process of the invention may be any suitable yeast or- 
ganism which, on cultivation, produces large amounts of the insulin precursor and insulin 
precursor analogues of the invention. Examples of suitable yeast organisms may be strains 
selected from the yeast species Saccharomyces cerevisiae, Saccharomyces kiuyveri, 
Schizosaccharomyces pombe, Sacchoromyces uvarum, Kluyveromyces lactis, Hansenula 
polymorpha, Pichia pastoris, Pichia methanoiica, Pichia kiuyveri, Yarrowia lipolytica, Candida 
sp., Candida utilis, Candida cacaoi, Geotrichum sp., and Geotrichum fennentans. 

The transformation of the yeast cells may for instance be effected by protoplast 
formation followed by transformation in a manner known per se. The medium used to 
cultivate the cells may be any conventional medium suitable for growing yeast organisms in 
industrial scale. The design and operation of a typical fermentor is described in Bailey & OIlis, 
Biochemical Engineering Fundamentals, p620-26, Sec.ed. 1986, McGraw-Hill, ISBN 0-07- 
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00321 2-2A. The cuKivation may be conducted as a batch culture, i.e. all ingredients added 
before inoculation, as a fed batch culture where a limiting nutrient is added during the 
fermentation or as a continuous culture which means that at a constant feed of complete 
medium is fed to the fermentor and an equal amount of culture medium is removed from the 
fermentor as described in Aiba. Humphey and Millis: Biochemical Engineering. Academic Press 
N.Y., 1965. In one embodiment the cultivation is conducted as a batch culture that is a closed 
culture system which contains an initial, limited amount of nutrients. 

With "industrial or commercial scale" is meant production scale higher than 10 m^, 
preferably between 1 0 and 500 m^. more preferably between 50 and 200 m^ and even more 
preferably between 80 and 1 80 m^. 

The expressed product will be recovered from the medium by conventional 
procedures including separating the yeast cells from the medium by centrifugation or 
filtration, precipitating the proteihaCeous components of the supernatant or filtrate by means 
of a salt, e.g. ammonium sulphate or by ion exchange chromatography, followed by 
purification by a variety of chromatographic procedures, e.g. ion exchange chromatography, 
affinity chromatography, or the like. 

After secretion to the culture medium and recovery, the isolated polypeptide may be 
subjected to various in vitro procedures for example to remove a possible N-terminal exten- 
sion sequence and/or other unwanted sequences in the expressed and secreted product. 
Thus the desired polypeptides, e.g. an insulin precursor or an insulin precursor analogue 
may be expressed with an N-terminal amino acid residue extension, as described in U.S. 
Patent No. 5,395,922. and European Patent No. 765,395A, both of which patents are herein 
specifically incorporated by reference. The N-terminal extension may be removed from the 
recovered insulin precursor or insulin precursor analogue by means of a proteolytic enzyme 
which is specific for a basic amino acid (e.g., Lys) so that the terminal extension is cleaved 
off at the Lys residue. Examples of such proteolytic enzymes are trypsin or Achromobacter 
lyticus protease. If the insulin precursor or insulin precursor analogue comprise a connecting 
peptide such connecting peptide will cleaved off by in vitro enzymatic conversion by means 
of trypsin or a trypsin like enzyme or an Achromobacter lyticus protease. If the insulin pre- 
cursor or insulin precursor analogue is expressed and secreted as single chain precursor 
wherein the B29 amino acid residue is linked to the A1 amino acid residue by means of a 
peptide bond or a connecting peptide such as an AlaAlaLys sequence (MI3) the insulin pre- 
cursor is converted into insulin by enzymatic conversion with trypsin or Achromobacter lyti- 
cus protease in the presence of an L-threonine ester followed by conversion of the threonine 
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ester of the insulin or insulin analogue into insulin or the insulin analogue by basic or acid 
hydrolysis as described in US patent specification No. 4.343,898 or 4,916.212. 

The present invention is described in further detail in the following examples that are 
not in any way intended to limit the scope of the invention as claimed. The attached Figures 
are meant to be considered as integral parts of the specification and description of the inven- 
tion. All references cited are herein specifically incorporated by reference for all that is de- 
scribed therein. 

EXAMPLES 
General Procedures: 

Plasmids and DNA 

All expressions plasmids are of the C-POT type, similar to those described in WO 
EP 171 142, which are characterized by containing the Schizosaccharomyces pombe triose 
phosphate isomerase gene (POT) for the purpose of plasmid selection and stabilization in S. 
cerevisiae. The plasmids furthermore contain the S. cerevisiae triose phosphate isomerase 
terminator. 

Yeast Strain and Transformation 

S. cerevisiae strain MT663 (MATa/MATa pep4-3/pep4-3 HIS4/his4 tpi::LEU2/tpi::LEU2 
Cir*) or S. cerevisiae strain ME1719 (/VMTa/yWyATa Ayap3::URA3/Ayap3::ura3 pep4-3/pep4'3 
Atpi::LEU2/Atpi::LEU2 Ieu2/leu2 URA3/URA3) were used as host strains for transformation. 
Strain MT663 was deposited in the Deutsche Sammlung von Mikroorganismen und Zellkul- 
turen in connection with filing WO 92/11378 and was given the deposit number DSM 6278. 
Strain ME1719 is described in WO 98/01535. Transformation of MT633 and MT1719 was 
conducted as described in WO 97/22706 and WO 98/01535, respectively. 

Yeast Expression 

Yeast strains harbouring plasmids as described above, were grown in YPD media 
(Guthrie , C. & Fink, G.R., Eds., Guide to Yeast Genetics and Molecular Biology, Academic 
Press, 1991). For each strain 2-4 individual 5 ml cultures were shaken at 30'C for 72 hours, 
with a final ODeoo of approximately 15-20. After centrifugation the supernatant was removed for 
quantitative analysis by which method the concentration of secreted heterologous product was 
measured. For GLP-1 the analysis was by HPLC as described in copending patent application 
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DK PA 2001 01141. For insulin precursors the analysis was by HPLC by a method described 
by Snel. L. et al. Chromatographia 24 (1987) 329-332. For SMBP an insulin binding assay was 
perfonned as described by Andersen. A.S. et al.. J. Biol. Chem. 275 (2000) 16948-16953. For 
AMMEP an assay for proteolytical activity was performed as follows: An MI3 precursor with the 
5 extension GluGluAlaGluAlaGluAlaLys(SEQ ID NO:2) (described in Kjeldsen et al.. Gene 170: 
107-112. 1996) was incubated with supernatant from yKV333 or yEA284 overnight at 37°C. 
Subsequently the supernatants were analysed for GluGluAlaGluAlaGluAlaLys(SEQ ID NO:2)- 
extended MI3 or MI3 with a single N-teminal Lys residue (the product that results from AMMEP 
cleavage) by HPLC as described above. The ratio of Lys-MI3/GluGluAlaGluAlaGluAlaLys (SEQ 
10 ID NO:2)-Mt3 was used as a measure for AMMEP expression level. 

in Table 1 the expression levels of a number of insulin precursors, a GLP-1(7- 
37)Lys34Arg analogue as well as Sf-IBP and AMMEP obtained by use of the CIT1 promoter 
are given as percentage of the level obtained with the corresponding constructs using the TPI1 
M promoter in stead of the CIT1 promoter. 

g 

Q Example 1. 

rU Construction of expression vector encodino an insulin precursor M13 under expression control 

CO 

|sy of the CIT1 promoter. 

Cn Figure 1 shows a yeast plasmid called pMT742 (Egel-Mitani et al.. Gene, 73, 1988, 99. 

20 113-120). The plasmid contains an expression cassette comprising an EcoRI - Xba\ frag- 

j,^ ment inserted into the plasmid between the transcription-promoter (located on a Sa/I - EcoRI 

lU fragment) and the transcription-term inator of the 5. cerevisiae TPI gene. 

C3 

Q In plasmid pMT742 the EcoRI - Xba\ fragment encodes a fusion product composed of 

the MFal pre-pro leader, a Lys-Arg cleavage site for the dibasic processing endopeptidase 

25 KEX2, and the single-chain mini-insulin precursor MI3. In order to construct a plasmid where 
the promoter from the TPI gene has been replaced by the promoter from the CIT1 gene of S. 
cerevisiae, the following steps were performed using standard molecular biology techniques 
(e.g. Sambrook, J., Fritsch, E.F. and Maniatis, T., Molecular Cloning: A laboratory Manual, 
Cold Spring Harbour Laboratory Press, New York, 1989). 

30 A 722 bp DNA fragment containing the CIT1 promoter was amplified with PCR from 

genomic S. cerevisiae DNA (from strain MT663) using oligonucleotides El 02 (5'-TAT AAG 
AGA GGT CGA CTT ATT GTT GTC AGC CCA ATG ATT C-3') (SEQ ID No: 3) and El 03 
(5'-TAT CGC TGA CGA ATT CGT AAA TAG TAT TAT ATT GCT ATA TGT-3') (SEQ ID 
No: 4). El 02 has been designed to introduce a Sa/I restriction site, and El 03 an EcoRI restic- 
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tion site, into the amplified DNA fragment. After digestion with Sa/I and HcoRI the CIT1 pro- 
moter can thus be cloned as a Sa/I - EcoRI fragment. 

To facilitate the cloning, the Apal - EcoRX of pMT742 containing the TPI promoter 
from S. cerevisiae was subcloned into a pBluescriptll-SK (Stratagene) Apaf - EcoR\ vector 
5 fragment. The resulting plasmid pEA222 was digested with Sa/1 and EcoRI and the vector 
fragment was ligated to the Sal\/EcoR\ DNA fragment containing the CIT1 promoter. The re- 
sulting plasmid pSurf/CIT was digested with A/col and EcoRI and the DNA fragment contain- 
ing the CIT1 promoter was ligated to the Nco\/Xba\ vector fragment and the EcoRUXbai 
fragment from pMT742 resulting in the final plasmid pEA268. 

10 The expression plasmid was propagated in E. coii, grown in the presence of ampicillin 

and isolated using standard techniques (Sambrook et al., 1989). The plasmid DNA was 
checked for insert by appropriate restriction nucleases (e.g. EcoRI. A/col. Sa/I. Xba\) and was 
shown by sequence analysis to contain the proper sequence of the CIT1 promoter. 

The plasmid pEA268 was transformed into S. cerevisiae strain MT663. Yeast trans- 

15 formants harbouring plasmid pEA268 were selected by glucose utilization as carbon source 
on YPD (1% yeast extract, 2% peptone, 2% glucose) agar (2%) plates and the resulting 
strain was named yEA268. The control strain MT742 is MT633 transformed with pMT742. 

Example 2 

20 Construction of expression vector encoding an N-terminal extended insulin precursor MIS un- 
der expression control of the CIT1 promoter. 

The EcoRI/Xbal DNA fragment containing the expression cassette from plasmid 
pAK729 (described in WO 97/22706) encodes a fusion product composed of a signal peptide 
followed by a synthetic leader, a Lys-Arg cleavage site for the dibasic processing endopepti- 

25 dase KEX2, a spacer and processing site (GluGluAlaGluProLys) (SEQ ID NO:5) and the sin- 
gle-chain mini-insulin precursor MIS. This fragment was ligated to the Nco\IXba\ and the 
Nco\IEcoR\ fragment (containing the CIT1 promoter) from pEA268 resulting in the final plasmid 
pEA274. The plasm ids were propagated in E. co//. isolated and checked for the correct insert 
as in example 1. The plasmlds pAK729 and pEA274 were transformed into S. cerevisiae strain 

30 MT663 and transfomiants selected as described in example 1. The resulting yeast strains were 
named yAK729 and yEA274, respectively. 
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Example 3 

Construction of expression vector encoding an N-terminal extended insulin precursor (B(1-29)- 
AsD-Pro-Lvs-A(1-2in under expression control of the CIT1 promoter. 

Copending patent application DK PA 2001 00547 discloses expression of a fusion pro> 
5 duct comprising a signal peptide followed by a synthetic leader, a LysArg cleavage site for the 
dibasic processing endopeptidase KEX2, a spacer and processing site (GluGluGlyGluGlu- 
ProLys) (SEQ ID N0:6) and an insulin precursor B(1-29)-AspProLys-A(1-21). An EcoRI/Xbal 
fragment containing DNA encoding this fusion product was ligated to the Nco\/Xba\ and the 
Nco\/EcoR\ fragment (containing the CIT1 promoter) from pEA268. The resulting plasmids 
10 were propagated in E. coli, isolated and checked for the correct insert as in example 1 and 
were transformed into S. cerevisiae strain ME1719. Transformants were selected as described 
in example 1 . Yields compared to the control are given in Table 1 . 



Example 4 

S 15 Expression of GLP'1(7-37)Arq34 with an N-terminal extension 

p Copending patent application DK PA 2001 01141 discloses expression of a fusion 

fU product comprising the MFa1 pre-pro leader, a LysArg cleavage site for the dibasic processing 
CO 

|.y endopeptidase KEX2, a spacer and processing site GluGiuAlaHisLys (SEQ ID NO: 7) and a 

Cn GLP-1 analogue wherein Lys in position 34 is substituted with an Arg. An EcoRI/Xbal DNA 

20 fragment containing the DNA ecoding the fusion product was ligated to the ApaUXbal and the 

j,4 Apa\/EcoR\ fragment (containing the CIT1 promoter) from pEA268. The resulting plasmids 

lU were propagated in E. coli, isolated and checked for the correct insert as in example 1 and 
£3 

Q were transformed into S. cerevisiae strain ME1719. Transformants were selected as described 
in example 1 . 



25 



Example 5 

Construction of expression vectors for Sf-IBP containing the TPI or CIT1 promoter 



Spodoptera frugipera (fall armyworm) insulin-like peptide binding protein has been de- 
scribed by Andersen, A.S. et al.. J. Biol. Chem. 275 (2000) 16948-16953 and the DNA se- 
quence for the gene has been deposited in GenBank with accession number AF236641. 

Oligonucleotides were designed ihat allowed PCR amplification of a DNA fragment en- 
oding amino acids 23 to 265 of S^IBP furnished with an N-terminal extension having the 
amino acid sequence EEAEPK, Using a combination of PCR and overlap PCR, followed by 
isolation and cloning by standard/methods (Norton et al., Gene 77:61-68, 1989, Sambrook et 
35 al..1989) an expression vector iiEA263 containing an expression cassette encoding SP-leader- 
KR-Ext-Sf-IBP was obtained/The EcoRI/Nhel fragment from plasmid pEA263 containing the 
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expression cassette.was llgated to the Nco\/Nhe\ ancKthe NcoUEcoRi fragment (containing the 
CIT1 promoter) from pEA268 resulting in the final pfasmid pEA286. 

The plasm ids were propagated in E. coli, isolated and checked for correct insert as in 
example 1. Plasm ids pEA263 and pEA286 were transformed into S. cerevisiae strain MT663 
and transformants were selected as described in example 1 . The resulting yeast strains were 
named yEA263 and yEA286. respectively. 



Example 6 

Construction of expression vector for AMMEP 

The gene for the Armillaria mellea endometallopreptidase has been cloned by Vad. K 
and Thim. L. and the nucleotide sequence has been deposited in GenBank with accession 
number AJ23871 8. Oligonucleotides containing EcoRt and Xba\ restriction sites were designed 
-to PCR-amplify the.entire coding sequence, After isolation and digestion with EcoRI and Xbal, 
the EcoRI/Xba! DNA fragment containing the entire AiVIMEP coding sequence was ligated to 
the Nco\fXba\ vector fragment and the Nco\/EcoR\ fragment containing the TPI promoter (both 
from pMT742) resulting in the final plasmid pKV333. 

The EcoRVXbal fragment from plasmid pKV333 containing the expression cassette for 
AMMEP was ligated to the NcoUXbal and the Nco\/EcoR\ fragment (containing the CIT1 pro- 
moter) from pEA268 resulting in the final plasmid pEA284. 

The plasm ids were propagated in E. coli, isolated and checked for insert as in example 
1. Plasm ids pKV333 and pEA284 were transformed into S. cerevisiae strain MT663 and trans- 
formants selected as in example 1. The resulting yeast strains were named yKV333 and 
yEA284, respectively. 



6210.200-US 
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Table 1 shows the increase in yield by use of the CIT1 promoter compared to the TPI pro- 
5 moter. It appears from the table that use of the CIT1 promoter gives from 50 to more tan 340% 
increase in expressed and secreted product. 
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Table 1 



Heterologous 
Product 


Relative expression level 

compared to control 


MI3 — . ■ 


142% 


MI3 with N-terminal extension SEQ ID NO:5 


180% 


Sf-IBP 


226 % 


AMMEP 


190% 


B(1-29)-AspProLys-A(1-21) with N-terminal 
extension SEQ ID NO:6 


340% 


GLP-1(7-37)Arg 34 with N-terminal 
extension SEQ ID NO:7 


207% 



